The rigorous solution for the average distance of a Sierpinski network 
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' The closed-form solution for the average distance of a deterministic network-Sierpinski network-is 

, found. This important quantity is calculated exactly with the help of recursion relations, which are 

based on the self-similar network structure and enable one to derive the precise formula analytically. 
The obtained rigorous solution confirms our previous numerical result, which shows that the average 
distance grows logarithmically with the number of network nodes. The result is at variance with 
that derived from random networks. 
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I. INTRODUCTION 



Structural properties [i[ , such as degree distribution [3] , average distance Q , degree correlations |4j] , community [1] , 
motifs [i| , fractality 0] , and symmetry Q , have received much attention in the field of complex networks, since these 
features play significant roles in characterizing and understanding complex networked systems in nature and socl- 
es ' ety. Among these important features, average distance characterizes the small-world behavior commonly observed in 
^ ■ various disparate real networks It has been established that average distance is related to other structural prop- 
I I erties, such as degree distribution [1, [13], fractality [ill [III and symmetry [l^. On the other hand, average distance 
■ O ' has an important consequence on dynamical processes taking placing on networks, including disease spreading 



^ nas an important consequence on aynamicai pre 
^ routing [l^ , robustness [ll|, [l3| , percolation 



attention for the scientific community [13, [13, [13, 



16 1, and so on. Thus far, average distance has become a focus of 
20|,|2l|,[2|. 

Using above-mentioned structural properties, extensive empirical studies on diverse real systems have been done 
with an attempt to uncover and understand the generic features and complexity of these systems, and various network 
■ models have been proposed to reproduce or explaining the common characteristics of real- life networks [23,, 2A, 2Il^ 2^ ■ 
. Recently, inspired by the well-known Sierpinski gasket, we proposed a novel network, called Sierpinski network [27|. 
The Sierpinski network belongs to a deterministically growing class of networks that have attracted considerable 
attention and have turned out to be a useful tool 28, 29, 30, 31, 32, 33, JM]. Many relevant topological properties of 



Sierpinski network such as degree distribution, clustering coefficient, and strength distribution have been determined 



analytically |27|. Also, the average distance of Sierpinski network has been been inve stig ated numerically, which was 
shown to behaves a logarithmic scaling with the number of network nodes (vertices) [23 • 
\ In view of the importance and usefulness of the quantity — average distance, here we derive a closed-form formula 
. for the average distance characterizing the Sierpinski network. The analytic method is on the basis of the recursive 
' construction and self-similar structure of Sierpinski network. Our precise result shows that the average distance of 
• i-H : Sierpinski network increases logarithmically with the number of nodes. This scaling behaves differently from that of 
random networks [3, [l3| ■ Our rigorous solution confirms the scaling between average distance and number of network 
^ J nodes that was previously obtained numerically in [27| . 

II. BRIEF INTRODUCTION TO THE SIERPINSKI NETWORK 

The Sierpinski network is derived from a variation of the Sierpinski gasket [35| . The Sierpinski gasket variant, shown 
in figure[l] is constructed as follows [36] . We start with an equilateral triangle, and we denote this initial configuration 
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FIG. 1: The first two stages in tiie construction for a variant of Sierpinkski gasket. 




FIG. 2: Illustration of the network corresponding to the fractal shown in figure [T] 

by generation t = Q. Then in the first generation t = 1, we divide the three sides of the equilateral triangle in three, 
then join these points and remove the three down pointing triangles. This forms six copies of the original triangle, 
and the procedure is repeated indefinitely for all the new copies. In the limit of infinite t generations, we get a fractal 
variant of the Sierpinski gasket. The Hausdorff dimension of the obtained fractal is df = 1 + In 2/ In 3 [37[. From 
the fractal, one can define the Sierpinski network [27| . where vertices correspond to the removed triangles and two 
vertices are connected if the boundaries of the corresponding triangles contact each other. Note that for uniformity, 
the three sides of the initial equilateral triangle at step also correspond to three different vertices. Figure [2] shows 
the network construction process. 

The Sierpinski network can be generated using an iterative algorithm [23| . We denote the network after t iterations 
by Ft with t > 0. Then the network is constructed as follows: For t = 0, Ft is a triangle. Next, three nodes are added 
into the original triangle. These three new nodes are connected to each other forming a new triangle, and both ends 
of each edge of the new triangle are linked to a node of the original triangle. Thus Fi is obtained, see figure [3] For 
t > 1, we can get Ft from Ft-i. For each of the existing triangles of Ft-i that does not consist of three simultaneously 
emerging nodes and has never generated a node before, we define it an active triangle. We replace each of the active 
triangles in Ff_i by the connected cluster on the right hand of figure [3] to obtain Ft- 

The resulting network presents the typical characteristics of real-life networks in nature and society [l^l . It has 
power-law distributions of degree and strength, with exponents ■jk — 2 + and 7^ = | + -^f^ , respectively. There 
also exists a power-law scaling relation between the strength s and degree k of an individual node, i.e. s ^ k'^. On 
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the other hand, for any individual vertex, its clustering coefficient is C{k) = f — when k is large, C{k) is 

approximately inversely proportional to degree k. The mean value C of clustering coefficients of all vertices is very 
large, which asymptotically reaches a constant value 0.598. Moreover, the network is a maximal planar graph. 



III. RIGOROUS DERIVATION OF AVERAGE DISTANCE 



After introducing the Sierpinski network, we now derive analytically the average distance. We represent all the 
shortest path lengths of network Ft as a matrix in which the entry dij is the distance between node i and j that is 
the length of a shortest path joining i and j. A measure of the typical separation between two nodes in Ft is given 
by the average distance dt defined as the mean of distances over all pairs of nodes: 

, _ Dt 
- Nt{Nt - l)/2 ' 

where 

Dt= '^^1 (2) 

denotes the sum of the distances between two nodes over all couples. 



A. Recursive equation for total distances 



We continue by exhibiting the procedure of determining the total distance and present the recurrence formula, 
which allows us to obtain Dt+i of the t + 1 generation from Dt of the t generation. The Sierpinski network Ft has 
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a self-similar structure that allows one to calculate Dt analytically [3^. As shown in figure IH network Ft+i may be 

- t J ■ ' ■ ; 



obtained by joining at six edge nodes (i.e., A, B, C, X, Y , and Z) six copies of Ft that are labeled as F^^'^ 



Ft^^ [39| . From this we can obtain the recursion relation 

Nt+i^&Nt- 12, (3) 

for the network order Nt, which is the number of nodes in the graph of generation t. This recursion, coupled with 
TVo = 3, yields 

iv, . iil4±iH (4) 

as previously obtained in j27| . 

According to the second construction method, the total distance Dt+i satisfies the recursion relation 

A+i = 6A + At-6, (5) 

where A( is the sum over all shortest path length whose endpoints are not in the same f/'''' branch. The last term 
-6 on the right-hand side of Eq. ([5]) compensates for the overcounting of certain paths: the shortest path between A 
and X, with length 1, is included in both .f/^' and f/^-*. Similarly the shortest path between A and F, the shortest 
path between B and AT, the shortest path between B and Z, the shortest path between C and Y , and the shortest 
path between C and Z, are all computed twice. To determine Dt, all that is left is to calculate A^. 



B. Definition of crossing distance 



In order to compute At, we classify the nodes in Ft+i into two categories: the six edge nodes (such as A, B, C, 
X, Y, and Z in figure are called hub nodes, while the other nodes are named non-hub nodes. Thus At, named 
the crossing distance, can be obtained by summing the following path length that are not included in the distance 
of node pairs in Fj : length of the shortest paths between non-hub and non-hub nodes, length of the shortest paths 
between hub and non-hub nodes, and length of the shortest paths between hub nodes (i.e., cIaz, dsY, and dcz)- 

Denote A"'^ as the sum of all shortest paths between non-hub nodes, whose endpoints are in and Ff \ 

respectively. That is to say, A"'^ rules out the paths with endpoint at the hub nodes belonging to F^"^ or Ft^K For 
example, each path contributed to At' does not end at node A, B, X or Y. On the other hand, let be the set of 
non-hub nodes in Ff^K Then the total sum At is given by 

At = Al'^ + A]-"^ + Al'^ + Al-^ + At^'^ + Af'^ + Af * + Af'^ + A?'^ -|- Af * + Af^ + A?'^ + Af'^ 

+ A^^ + A^^ + J2 + + ^^j- + + '^Bi + Y 

jeof jen-' jens jso? jeo? jeof 

+ Y + Y '^cj + Y + Y '^^1 + Y '^^1 + Y '^^1 + Y + Y '^^j 

iGo; jen^ jen<^ jeo? jenf jeni jenl j&nl 

+ Y, '^Y] + Y, '^^■J' + Y^ + Y^ '^-'^^ ^ '^^^ '^^^ 

jen<i ]en\ jeni jenf 

By symmetry, At^'^ = A^'^ = A?'^ -t- A?'^ = A^^ = A^^ At^'^ = A^'^ = A? = A?-« = A?'^ ^ A^^ At^'^ = A?'^ = 

Yjenl ^Zj — J2jen^ ^Zj, and d^z ~ dsY = dcx ~ 2, so Eq. ^ can be simphfied as 

At = 6 Al'^ + 6 At^'^ + 3 At^'^ + 18 ^ d^j + 6. (7) 

Having At in terms of the quantities of Aj'^, A]'^ , Aj'*, and X)jgo* dAj, the next step is to explicitly determine these 
quantities. 



FIG. 5: Illustration of the classification of interior nodes in (77 — 1, ■ ■ ■ ,6), from which we can derive recursively the 

classification of interior nodes in network Ft+i. 



C. Classification of interior nodes 

To calculate the crossing distance Aj'^, A]'^, A^'**, and X^jen* '^^j^ classify interior nodes in network Ft+i into 
seven different parts according to their shortest path lengths to each of the three hub nodes (i.e. A, B, C) of the 
peripheral triangle AABC. Notice that nodes A, B, C themselves are not partitioned into any of the seven parts 
represented as Pi, P2, ^3, Pa, P5, Pq, and P7, respectively. The classification of nodes is shown in figure [5] For any 
interior node v, we denote the shortest path lengths from v to A, B, C as a, b, and c, respectively. By construction, 
a, b, c can differ by at most 1 since vertices A, B, C are adjacent. Then the classification function class{v) of node v 
is defined to be 



class(v) = < 



Pi 
P2 
P3 
Pi 
P5 
Pe 
P7 



for a <b = c, 
for b < a ~ c, 
for c < a = 6, 
for a — c < b^ 
for a — b < c, 
for b = c < a, 
for a = b = c. 



(8) 



.(1) 



It should be mentioned that the definition of node classification is recursive. For instance, class Pi and P4 in F^ 
belong to class Pi in Pt+i, class P3 and P5 in F^'' belong to class P2 in Pt+i, class P2, Pe, and P7 in p/^-* belong to 
class P5 in Pt+i. Since the three nodes A, B, and C are symmetrical, in the Sierpinski network we have the following 
equivalent relations from the viewpoint of class cardinality: classes Pi, P2, and P3 are equivalent to one another, and 
it is the same with classes P4, P5, and Pg. We denote the number of nodes in network Ft that belong to class Pi 
as N-t^p-^, the number of nodes in class P2 as Nt^p^, and so on. By symmetry, we have A^t.Pi — Nt,P2 — Nt^p^ and 
Nt^p^ — Nt^p^ — Nt^pg. Therefore in the following computation we will only consider Nt^p-^, Nt^p^, and Nt^p^. It is 
easy to conclude that 



Nt = Nt,p, + Nt,p., + Nt,p., + Nt,p^ + Nt,p, + Nt.Pg + Nt.p, + : 
= ZNt,P,+iNt.P,+Nt,P,+i. 



(9) 



Considering the self-similar structure of Sierpinski network, we can easily know that at time t + 1, the quantities 
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Nt+i.Pii -^t+i,P4, and Nt+i^p^ evolve according to the following recursive equations 

Nt+i,P, = 3 Nt.P, + 4 Nt,P^ + Nt.P, , 

iVt+i,P4 = 3 Nt.P, + Nt^P, + 7Vt,P, + 1 , (10) 
Nt+l,Pr = iNt,p,, 

where we have used the equivalent relations Nt,p^ = Nt,p^ = Nt,p^ and Nt^p^ = Nt,p^ = A^t.Pg- With the initial 
condition 7V2,Pi = 4, N2,Pi = 2, and N2,Pj = 3, we can solve the recursive equation (fTU]) to obtain 



f = 2lo [-112 + 25 X (-2)* + 27 x 6*] , 

{ Nt,p, = [16 - 25 X (-2)* + 9 X 6*] , (11) 

[ Nt^Pr = = 4 [32 + 25 X (-2)* + 3x6*]. 

For a node v in network Ft+i, we are also interested in the smallest value of the shortest path length from v to any 
of the three peripheral hub nodes A, B, and C. We denote the shortest distance as which can be defined to be 

fy = ■min{a,b, c). (12) 

Let dt^p^ denote the sum of of all nodes belonging to class Pi in network Ft. Analogously, we can also define the 
quantities dt^p^, dt^p^, • • • , dt^Pj. Again by symmetry, we have dt^p^ = dt^p^ = dt.Pa, dt^p^ = dt,p^ = dt^Pg, and dt^p^^, 
dt,Pii dt,p^ can be written recursively as follows: 



{dt+i,p^ = 3dt,p^ + 4dt,p^ + dt.p^ , 

dt+i,p, = 3 dt.Pi + dt,P^ + dt.p, + 3 Nt.p, + 1 , (13) 
dt+i,Py = i{dt,Pi + Nt^pJ . 

Substituting Eq. (jlip into Eq. (|13p . and considering the initial condition (i2.P1 = 4, (i2,P4 = 2, and (i2.P7 = 6, 
Eq. (fT3|) is solved inductively 

dt.Pi 25^[2048 + 327 x 6* + 990tx 6* + 25(33 + 70i)2*e*'^*], 

dt,P, = [~2075(-2)* - 1024(-1)2* + 699 x 6* + 10< (-175 x (-2)* + 33 x 6*)] , (14) 

^*.-P7 = [4096+ 1329 X 6* + 330ix 6* + 25(359 + 210t)2*e"*]. 



D. Calculation of crossing distances 



Having obtained the quantities Nt^p- and dt^p^ [i = 1, 2, ■ • • , 7), we now begin to determine the crossing distance 
A('^, A('^, A('^, and X^jen* ^Aj expressed as a function of A^f,p. and dt.Pi- Here we only give the computation details 
of Aj'^, while the computing processes of A^'^, A^'^, and X^jen'' '^J^o ^'"^ similar. For convenience of computation, we 
use r^'' to denote the set of interior nodes belonging to class Pi in f/''''. Then A^'^ can be written as 



A^ — ^ ^ duv + ^ ^ dy^y + ^ ^ diiv + ^ ^ ^iiv 

VytA,X,Y v^A,X,Y vyiA,X,Y v^A,X,Y 

+ ^ duv + ^ duv + ^ rfui;- (15) 

■u^A,X,y i;7^/l,X,y t)^A,X,F 

The seven terms on the right-hand side of Eq. (fT5|) are represented consecutively as (5^ (i = 1, 2, • • • , 7). Next we will 
calculate the quantities SI- By symmetry, = S^, = Sf . Therefore, we need only to compute Sj, 6f, Sf, and Sj. 
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Firstly, we evaluate Sj. By definition, 

v^A,X,Y 



= ^ {duA + dAv)+ ^ {duA + d-AY + dvv) 

«er;'\i;er?'iUr?'*'ur?'''ur?-'' uer;'\i>erp 
+ ^ + dAx + dxv) 

= Nt,P, {Mt,P, + 3dt^p, + duP, + 2Nt,P, + A^t,pJ + dt,P, i3Nt,P, + SNt^P, + Nf,p,). (16) 
Proceeding similarly, we obtain 

^ Nt,P,{Mt,P, + Mt,P, + dt,P, + ANt,P, + 3iVt,p, + iVt,p,) + dt,pA?>Nt,P, + 3A^t,p, + iVj^p,), (17) 

-^t^ = ^t,P2(3dt,Pi + 3dt,p, + dt,P, + Nt.P,) + dt,p,(3iVt,p, + 3A^t,p, + Nt.P,), (18) 

<5f = Nt^P, {idt^p, + 3dt,p, + dt,p, + 2iVt,p, + A^t^pJ + dt^p, [iNt^p, + 3Nt,p, + Nt,p,), (19) 

and 

Sj = Nt,p,{3dt.P, + 3dt.p, + dt,P, + Nt,pJ + dt,P,{3Nt,P, + 3Nt,P, + Nt,P,). (20) 
With the obtained results for 61, we have 

A^' = 2(3dt,p, + 3dt,p, + dt,p,)(3A^t,p, + 3iVt,p, + Nt,P,) + Nt^p, {3Nt^p, + 3Nt^p, + Nt,P,) 

+2{Nt,P, + Nt^p,){2Nt,P, + Nt,P,) + Nt,pANt,p, + Nt,P,) + {Nt,P,f. (21) 

Analogously, we find 

= 2(3dt,p, + 3dt,p, + dt,p,)(3iVt,p, + 3iVt,p, + Nt,P,) + 2{Nt^p,f + 2NtM^Nt,P, + 3iVt,p, + Nt,P,) 

+NtM^Nt,P, + m,p, + Nt^Pr) + iNt^p^ + 2Nt^p, + Nt,P,){2Nt^P, + Nt,P,), (22) 

A^^ = 2(3dt,p, + 3dt,p, + dt,P,){mt,P, + 3Nt,P, + Nt,P,) + {m,p^ + m,p, + Nt^p,? + 3(iV^,pJ^ (23) 

and 

dAj = iMt,P, + Mt,P, + dt,Pr) + i3Nt,p, + 3Nt,P, + Nt,P,) + Nt,P,. (24) 

Substituting Eqs. (pi]) . ([^^ . (I^S)) . and into Eq. ([7|), we the final expression for cross distances At, 

At = ^^i^ [ - 17920 + 95040 x 6* + 160380 x 6^* + 50 x 2* (-32 + 27 x 6*)e*^* + 44550i x 6^* 

+ e2"*(6875 X 4* - 82944 x 6* + 8019 x 6^* + 26730i x 6^*)] . (25) 

E. Rigorous result of average distance 

With the above-obtained results and recursion relations, we now readily calculate the sum of the shortest path 
lengths between all pairs of nodes. Inserting Eq. ([25|) into Eq. ^ and using the initial condition D2 — 555, Eq. ^ 
is solved inductively. 



192000 



266240 - 34375 x 4* + 2000 x (-2)* + 225264 x 6* - 750(-12)* 

+ 27621 X 6^* + 20160t x 6* + 23760t x 6^*1 . (26) 
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1 10 100 1000 10000 100000 1000000 1E7 

FIG. 6: Average distance dt versus network order Nt on a semi- logarithmic scale. The solid line serves as guide to the eye. 



Substituting Eq. ([26| into Eq. ([T]) yields the exactly analytic expression for average distance 

- [266240 - 34375 x 4* + 2000 x (-2)* + 225264 x 6* 

750(-12)* + 27621 x 6^* + 20160i x 6* + 23760t x 6^* 



34560(84+ 57 x 6* + 9 x 62*) 



(27) 



In the large t limit, dt ~ t, while the network order Nt ^ 6* which is obvious from Eq. Thus, the average 

distance grows logarithmically with increasing order of the network. This scaling is consistent with the speculation 
in [23] based on computer simulations. We have also checked our analytic result provide by Eq. ((27|) against numerical 
calculations for different network order up to t = 8 which corresponds to Ns — 1007772. In all the cases we obtain a 
complete agreement between our theoretical formula and the results of numerical investigation, see figure [SI 

Recently, it has been suggested that for random scale-free networks with degree exponent 7^ < 3 and network order 
N, their average distance d{N) behaves as a double logarithmic scaling with N: d{N) ^ In In A'' [1, However, for 
deterministic Sierpinsiki network, in despite of the fact that its degree exponent 7^, = 2 + |^ < 3, its average distance 
scales as a logarithmic scaling with network order, showing a obvious difference from that of the stochastic scale-free 
counterparts. 



IV. CONCLUSION 



Average distance plays an important role in the characterization of the internal structure of a network, and has 
a profound impact on a variety of dynamical processes on the network. In this article, we have obtained rigorously 
the solution for the average distance of a deterministic Sicrpinski network. We have explicitly shown that in the 
limit of infinite network order, the average distance of Sierpinski network scales logarithmically with the number of 
network nodes, verifying our previously suggested scaling obtained through simulations [2^ . Our findings display 
that the scaling of the average distance for deterministic Sierpinski network is strikingly distinct from the counterpart 
of stochastic scale- free networks [1, • This disparity of the scaling for average distance between the deterministic 
Sierpinsk network and random scale-free networks is worth studying in future. 
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